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[Title of Invention] 
NUCLEIC ACID METHYLATION DETECTION PROCESS USING AN INTERNAL REFERENCE S 
AMPLE 

[Claims] 

1. A process for detecting methyl at ion at large numbers of CpG island si 
tes simultaneously using a reference sample obtained from the sample to 
be tested, comprising: 

(a) providing a sample of DNA for analysis; 

(b) dividing the DNA sample into a first DNA sample and a second DNA sam 
pie, whereby the first sample will become a test sample and the second s 
ample will become an internal reference sample; 

(c) amplifying the second DNA sample by a nucleic acid amplification pro 
cess such that methyl cytosine residues are amplified as unmethylated cyt 
osine residues; 

(d) bisulfite converting the amplified first DNA sample and the second D 
NA sample to convert unmethylated cytosine residues to deoxyuracil resid 
ues in both samples; 

(e) amplifying the converted first DNA sample and the converted second D 
NA sample; 

(f) labeling the bisulf ite-converted second DNA sample with a second flu 
orescent marker and the bisulf ite-converted first DNA sample with a firs 
t fluorescent marker, wherein the first and second fluorescent markers h 
ave non-overlapping fluorescent excitation and emission spectra; and 

(g) hybridizing the first DNA sample and the second DNA sample onto a mi 
croarray device having a plurality of oligonucleotide capture probes des 
igned to hybridize to CpG island sites of the DNA sample as converted an 
d non-converted by bisulfite. 

2. The process of claim 1, wherein the amplification technique employed 
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^^^is PCR (polymerase chain reaction). 

3. The process of claim 1 or 2, wherein the hybridization conditions are 
high stringency. 

4. The process of any one of claims 1 to 3, wherein the non-overlapping 
fluorescent labels are Cy3, (1,1*- bis ( c -carboxypentyl) -l'ethyl-3, 3,3 
\3'-tetramethylindocarbocyanine-5,5'-disulfonate potassium salt di-N-hy 
droxysuccinimide ester) and Cy5 (1, l'-bis( e -carboxypentyl )-l' ethyl-3, 3, 

3',3'-tetramethylindodicarbocyanine-5,5'-disulfonate potassium salt di-N 
-hydroxysucc in im i de ester) . 

5. A microarray device for using for the process of any one of claims 1 
to 4, having a plurality of oligonucleotide capture probes designed to h 
ybridize to CpG island sites of the DNA sample as converted and non-conv 
erted by bisulfite. 

6. A kit for the process of any one of claims 1 to 4, comprising 

(a) The microarray device of claim 5, 

(b) Bisulfite converting reagents and DNA labeling reagents. 



[Detailed Description of Invention] 

[Field of Invention] 

The present invention provides a process for detection of DNA methyl 
ation at CpG sites using nucleic acid arrays and preferably microarrays. 

Specifically, the present invention provides a process for directly gen 
erating a reference sample from the sample to be tested and detecting me 
thylation at large numbers of CpG island sites simultaneously. Specifica 
lly, the inventive process comprises dividing a DNA sample into two samp 
les (a first sample and a second sample), amplifying the first DNA sampl 
e by a nucleic acid amplification process such that any methyl cytosine r 
esidues are amplified as unmethylated cytosine residues, treating the am 
plified first sample and the (unamplif ied) second sample with bisulfite 
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^^^to convert unmethylated cytosine residues in both samples to deoxyuracil 

residues, labeling the bisulf ite-converted second sample with a second 
fluorescent marker and the bisulf ite-converted first sample with a first 

fluorescent marker, wherein the first and second fluorescent markers ha 
ve non-overlapping fluorescent excitation and emission spectra; and hybr 
idizing the first sample and the second sample onto a microarray device 
having a plurality of oligonucleotide capture probes designed to hybridi 
ze to CpG island sites of the DNA sample as converted and non-converted 
by bisulfite. 

[Prior Art] 
Methvlation Assay Processes 

Methylation of cytosines (C) in the 5' position of the pyrimidine ri 
ng has been shown to be an important epigenetic determinant if a cell or 

t issue sample is cancerous. In animals, methyl cytosine is mainly found 
in cytosine-guanine (CpG) dinucleotides, whereas in plants it is most of 
ten found in cytosine-any base-guanine (CpNpG) trinucleotide sequences. 

Methylation of C residues in genomic DNA plays a key role in regulat 
ion of gene expression (Wolffe et al., Proc. Natl. Acad. Sci. USA 96:589 
4-5896, 1999) because the presence of 5-me thy 1 cytosine in the promoter o 
f specific genes alters the binding of transcriptional factors and other 
promoters to DNA (Costello and Plass, J. Med. Genet. 38:285-503, 2001). 
Further, 5-methyl cytosine in the promoter of specific genes also attrac 
ts methyl-DNA binding proteins and histone deacetylases that modify chro 
matin structure around the gene transcription site. Both effects result 
in blocking transcription and cause gene silencing (Bird, Nature 321:209 
-213, 1986). 

Generally, levels of methylcystine occurrence in genomic DNA have be 
en measured using two different general processes, including processes e 
mploying high-performance separation techniques or by enzymatic/chemical 
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means. In order to perfect large scale screening techniques, the enzyma 
tic/chemical means are preferred because they do not require expensive a 
nd complex analytical equipment. However, the enzymatic/chemical techniq 
ues have not been as sensitive as high-performance separation techniques 

and the resolution is often restricted to endonuclease cleavage sites. 

Two alternative approaches have been tried for DNA methylation detec 
tion, bisulfite methods and non-bisulfate methods. Non-bisulfate methods 

use methylat ion-sensitive restriction endonucleases combined with South 
em blot analysis or PCR detection, but often results are limited to cle 
avage sites. Bisulfite modification of DNA allows for quantitative deter 
mination of methylation status of an allele and requires PCR amplificati 
on of bisuifate-modif ied DNA. Differences in methylcytosine patterns are 

displayed by methylat ion-dependent primer designs (i.e., methylat ion-sp 
ecific PCR) in conjunction with methylat ion-sensitive restriction endonu 
cleases, genomic sequencing or other approaches. 

Bisulfite treatment of DNA converts unmethylated cytosine to uracil, 

while methylated cytosine does not react (Furuichi et al., Biochem. Bio 
phys. Res. Commun. 41:1185-1191, 1970). Bisulfate modification of genomi 
c DNA requires prior DNA denaturation because only methyl cytosines that 
are located in single strands are susceptible to attack (Shapiro et al., 
J. Am. Chem. Soc. 96:206-212, 1974). However, there are problems associ 
ated with bisulfite treatment, including, for example, only partial dena 
turation (Rein et al., J. Biol. Chem. 272:10021-10029, 1997), renaturati 
on problems in high salt concentrations, and incomplete desulfonation af 
ter bisulfate treatment (Thomassin et al., Methods 19:465-475, 1999). Mo 
reover, the total conversion of cytosines to uracils is critical to the 
analysis, so temperature, time and pH conditions are critical without de 
stroying the integrity of the DNA material. 

In bisulfite modification methylation detection processes, the most 
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straightforward way of measuring methylation at CpG islands is by sequen 
cing. However, sequencing techniques are also the most difficult (time c 
onsuming and expensive) and do not allow for multiplexing of large numbe 
rs of scattered CpG island sites in genomic DNA samples. In general, aft 
er denaturation and bisulfite modification of a genomic DNA sample, the 
resulting dsDNA is obtained by primer extension and the fragment of inte 
rest is amplified by PCR techniques (Clark et ai., Nucl. Acids Res. 22:2 
990-2997, 1994). Standard DNA sequencing of the PCR products then detect 
s Methylcytosine. Alternatively, one could clone the PCR products into p 
lasmid vectors followed by sequencing of individual clones for a slowed 
method but one that could also provide methylation maps of single DNA mo 
lecules. In another variation, direct localization of methylcytosines in 
the product of bisulfite treatment instead of the PCR product can be do 
ne using only three deoxynucleo tides (dATP, dCTP and dTTP) but lacking d 
GTP that produces an elongation stop at methylcytosine points (Radlinska 
and Skowronek, Acta Microbiol. Pol. 47:327-334, 1998). 

Another process in the bisulfite class is methylat ion-specific PCR ( 
Esteller et al., Cancer Res. 61:3225-3229, 2001; and Herman et al., Proc 
. Natl. Acad. Sci. USA 93:9821-9826, 1996), also called MSP. In normal ( 
non-cancerous) cells, cytosines in CpG islands are usually unmethylated, 
but they become methylated in the promoter sequences of genes associate 
d with certain abnormal cellular processes, such as cancer (Esteller et 
al., Cancer Res. 59:793-797, 1999; Esteller et al., Cancer Res. 61:3225- 
3229, 2001; and Esteller et al., Hum. Mol. Genet. 10:3001-3007, 2001). B 
isulfite-converted DNA strands are no longer complementary, so primer de 
sign in MSP is customized for each chain and methylation patterns of all 
sequences determined in separate reactions. MSP uses a difficult PCR pr 
ocess and critical primer designs using a narrow range of strand anneal i 
ng temperatures, the PCT product is between 80 and 175 base pairs, each 
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^^^^primer should contain at least two CpG pairs, the sense pair should cont 
ain a CpG pair at the 3' end and primers contain non-CpG cytosines. The 
MSP technique requires PCR and if the PCR goes for too many cycles of am 
plification without ensuring that the reaction is in the lineal response 
range with respect to template concentration, then large over-estimatio 
ns of the extent of methylation can be obtained if the sequence is ampli 
fiable with both the methyl at ion-specific primers and the primers for un 
methylated sequences. 

The MSP method was improved by combining methylat ion-specific PCR wi 
th in situ hybridization (Nuovo et al., Proc. Natl. Acad. Sci. USA 96:12 
754-12759, 1999) to allow for the methylation status of specific DNA seq 
uences to be visualized in individual cells, for monitoring complex tiss 
ue samples having both tumor and normal cells. Another method combines M 
SP with denaturing HPLC to allow for small cell mosaics of structurally 
normal or abnormal chromosomes to be detected (Baumer et al., Hum. Mutat 
. 17:423-430, 2001). Specifically, following PCR amplification, the two 
alleles can be resolved from the two populations of PCR products by dena 
turing HPLC because they differ at several positions within the amplifie 
d sequence. 

Another quantification approach has been called MethyLight and uses 
fluorescent-based, real-time PCR (U.S. Patent 6,331,393 the disclosure o 
f which is incorporated by reference herein; and Eads et al., Nucleic Ac 
ids Res. 28:E32, 2000). The DNA is modified by the bisulfite treatment a 
nd amplified by fluorescence-based, real-time quantitative PCR using loc 
us-specific PCR primers that flank an oligonucleotide probe with a 5' fl 
uorescence reporter dye and a 3' quencher dye. The reporter is enzymatic 
ally released during the reaction, and fluorescence, which is proportion 
al to the amount of PCR product and thus to the degree of methylation, c 
an be sequentially detected in an automated nucleotide sequencer device. 
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While fluorescence increases the sensitivity of this process, the proce 
ss is difficult, requires expensive instrumentation and consumables and 
cannot be multiplexed to detected hundreds or thousands of CpG island si 
tes simultaneously. 

Another approach has been to combine methyl-sensitive endonucleases 
with PCR amplification with subsequent hybridization to oligonucleotide 
microarrays (Huang et al., Hum. Mol. Genetics, 8:459-70, 1999). In this 
case, methylation state was determined by digestion of unmethylated DNA 
using methylation sensitive restriction enzyme. Unmethylated DNA was enz 
ymatically digested into fragments and did not generate ampl icons after 
PCR whereas methylated DNA was protected from digestion and did generate 

ampl icons after PCR. The presence or absence of ampl icons was detected 
on oligonucleotide microarrays using fluorescent tags. Samples from norm 
al tissues were used as a control with the supposition that these non-ca 
ncerous samples contained predominantly unmethylated cytosine residues. 
This procedure requires DNA from non-cancerous tissue to be available fo 
r use as an external control. Additionally, the exact methylation state 
of the external control needs to be ascertained before it can be confide 
ntly used to interpret results from a dual-hybridization assay. 

Another approach has been to perform a dual -hybridization assay usin 
g a test sample and an external reference sample known to be unmethylate 
d in the analyzed region (Balog et al., Anal Biochem. 309: 301-310, 2002 
). In this case, a 190-bp DNA duplex was synthesized and used as an exte 
rnal reference sample, or DNA was obtained from a sample known to be unm 
ethylated. The two samples were labeled with different fluorescent dyes, 
mixed and hybridized to an array containing 21mer oligonucleotides. The 
external reference sample generated signal in a reference fluorescent c 
hannel on capture probes hybridizing to a thymidine residue. The presenc 
e of signal on a capture molecule probing for the presence of C within t 
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^^^he test sample indicated methylation of that C residue. 

Therefore, there are a variety of methylation detection processes th 
at have advantages and disadvantages, but none have the ability to deter 
mine the methylation state of a large number of CpG islands without the 
presence of an external reference sample. . Therefore, there is a need i 
n the art to incorporate processes that do not require an external refer 
ence sample yet are able to multiplex DNA methylation assays to simultan 
eously determine methylation patterns. 
DNA Microarravs 

In the world of microarrays or biochips, biological molecules (e.g., 
oligonucleotides, polypeptides, oligopeptides and the like) are placed 
onto surfaces at defined locations for potential binding with target sam 
pies of nucleotides or receptors or other molecules. Microarrays are min 
iaturized arrays of biomolecules available or being developed on a varie 
ty of platforms. Much of the initial focus for these microarrays have be 
en in genomics with an emphasis on cellular gene expression, single nucl 
eotide polymorphisms (SNPs) and genomic DNA detection/validation, functi 
onal genomics and proteomics (Wilgenbus and Lichter, J. Mol. Med. 77:761 
, 1999; Ashfari et al., Cancer Res. 59:4759, 1999; Kurian et al., J. Pat 
hoi. 187:267, 1999; Hacia, Nature Genetics 21 suppl. :42, 1999; Hacia et 
al., Mol. Psychiatry 3:483, 1998; and Johnson, Curr. Biol. 26:R171, 1998 
)• 

There are, in general, three categories of microarrays (also "DNA Ar 
rays" and "Gene Chips" but this descriptive name has been attempted to b 
e a trademark) having oligonucleotide content. Most often, the oligonucl 
eotide microarrays have a solid surface, usually silicon-based and most 
often a glass microscopic slide. Oligonucleotide microarrays are often m 
ade by different techniques, including (1) "spotting" by depositing sing 
le nucleotides for in situ synthesis or completed oligonucleotides by ph 
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ysical means (ink jet printing and the like), (2) photolithographic tech 
niques for in situ oligonucleotide synthesis (see, for example, Fodor U. 
S. Patent 5,445,934 and the additional patents that claim priority from 
this priority document, (3) electrochemical in situ synthesis based upon 
pH based removal of blocking chemical functional groups (see, for examp 
le, Montgomery U.S. Patent 6,093,302 the disclosure of which is incorpor 
ated by reference herein and Southern U.S. Patent 5,667,667), and (4) el 
ectric field attraction/repulsion of fully-formed oligonucleotides (see, 
for example, Hollis et al., U.S. Patent 5,653,939 and its duplicate Hel 
ler U.S. Patent 5,929,208). Only the first three basic techniques can fo 
rm oligonucleotides in situ, which are, building each oligonucleotide, n 
ucleotide-by-nucleotide, on the microarray surface without placing or at 
tract ing fully formed oligonucleotides. 

The electrochemistry platform (Montgomery U.S. Patent 6,093,302, the 
disclosure of which is incorporated by reference herein) provides a mic 
roarray based upon a semiconductor chip platform having a plurality of m 
icroelectrodes. This chip design uses Complimentary Metal Oxide Semicond 
uctor (CMOS) technology to create high-density arrays of microelectrodes 
with parallel addressing for selecting and controlling individual micro 
electrodes within the array. The electrodes turned on with current flow 
generate electrochemical reagents (particularly acidic protons) to alter 
the pH in a small, defined "virtual flask" region or volume adjacent to 
the electrode. The microarray is coated with a porous matrix for a reac 
tion layer material. Thickness and porosity of the material is carefully 
controlled and biomolecules are synthesized within volumes of the porou 
s matrix whose pH has been altered through controlled diffusion of proto 
ns generated electrochemical ly and whose diffusion is limited by diffusi 
on coefficients and the buffering capacities of solutions. 

The microarrays that are made with oligonucleotide capture probes ar 
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e generally spotted onto glass slides. However, the glass slides are not 
well suited for creating a reaction chamber with the capture probes tha 
t form the spots as the hybridization reaction of target nucleic acids w 
ith the capture probes is long and involves controlled conditions. There 
fore, there is a need in the art to create better reaction chambers that 
allow for control of hybridization conditions including stringency cond 
it ions (e.g., temperature, gas pressures, chemical environment and pH). 
[Problems to be solved by Invention] 

In view of the many processes that have advantages and drawbacks for 
quantitative methylation determination, there is a need in the art for 
being able to multiplex many different sites or CpG islands for methylat 
ion analysis simultaneously and in parallel, preferably using existing D 
NA microarray technology. The present invention was made to develop a me 
thylation process adapted to DNA microarrays to take advantage of the mu 
ltiplex capabilities of DNA microarrays for methylation analysis. 
[Means to solve problems] 

The present invention provides a process for detecting methylation a 
t large numbers of CpG island sites simultaneously using a reference sam 
pie obtained from the sample to be tested, comprising: 

(a) providing a sample of DNA for analysis; 

(b) dividing the DNA sample a first DNA sample and a second DNA sample, 
whereby the first sample will become a test sample and the second sample 

will become an internal reference sample; 

(c) amplifying the second DNA sample by a nucleic acid amplification pro 
cess such that methylcytosine residues are amplified as unmethylated cyt 
osine residues; 

(d) bisulfite converting the amplified first DNA sample and the second D 
NA sample to convert unmethylated cytosine residues to deoxyuracil resid 
ues in both samples; 
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^^^^(e) amplifying the converted first DNA sample and the converted second D 
NA sample; 

(f) labeling the bisulf i te-converted second DNA sample with a second flu 
orescent marker and the bisulf i te-converted first DNA sample with a firs 
t fluorescent marker, wherein the first and second fluorescent markers h 
ave non-overlapping fluorescent excitation and emission spectra; and 

(g) hybridizing the first DNA sample and the second DNA sample onto a mi 
croarray device having a plurality of oligonucleotide capture probes des 
igned to hybridize to CpG island sites of the DNA sample as converted an 
d non-converted by bisulfite. 

Preferably, the amplification technique employed is PCR (polymerase 
chain reaction). Preferably, the hybridization conditions are high strin 
gency. Preferably, the non-overlapping fluorescent labels are Cy3, (1,1' 
-bis( e -carboxypentyl) -1' ethyl-3, 3, 3' , 3' -tetramethyl indocarbocyanine-5, 5 
'-disulfonate potassium salt di-N-hydroxysuccinimide ester) and Cy5 (1,1 
' -bis( e -carboxypentyl ) -1 9 ethy 1-3, 3, 3' , 3* -tetramethyl indodicarbocyanine- 
5, 5' -di sulfonate potassium salt di-N-hydroxysuccinimide ester); or the n 
on-overlapping fluorescent labels are Alexa Fluor 594 and Alexa Flour 54 
6; or the non-overlapping fluorescent labels are Fluorescene and Texas R 
ed. 

The present invention also provides a microarray device for using th 
e process of above-mentioned invention, having a plurality of oligonucle 
otide capture probes designed to hybridize to CpG island sites of the DN 
A sample as converted and non-converted by bisulfite, and a kit for the 
process of above-mentioned invention. Preferably, the kit comprises the 
microarray device, bisulfite converting reagents and DNA labeling reagen 
ts. 

[Mode for carrying out Invention] 

The present invention essentially provides a process wherein the met 
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^^^hylation state of cytosine residues within CpG islands is determined by 
analyzing the signal intensities at defined positions on a microarray de 
vice. On a microarray device, each position or site comprises a differen 
t capture probe oligonucleotide sequence. Therefore, multiple target mol 
ecules can be captured in a multiplex fashion, limited only by the numbe 
r of capture probe sites available on a microarray device. Those microar 
ray devices developed by CombiMatrix Corporation and marketed through Ro 
che Diagnostics (matriXarray™) for example, can contain up to about 13, 
000 different sites or an ability to develop a single assay on one chip 
to evaluate methylation at over 13,000 CpG islands simultaneously. 

A minimum of two positions is required on a microarray device to ide 
ntify the methylation state of each cytosine residue. For example, a sam 
pie containing a methylated (M) cytosine residue generates a target mole 
cule that contains a cytosine residue at a specific position, whereas a 
sample containing an unmethylated (U) cytosine residue generates a targe 
t molecule that contains a uracil residue at the specific position. For 
example, 

5' c 3' target molecule from methylated (M) samp 

le 

5' u 3' target molecule from unmethylated (U) sa 

mple. 

Each of these target molecules is captured at a different position o 
n the microarray device using a capture oligonucleotide probe with a com 
plementary sequence. High stringency conditions during hybridization and 
wash steps permit a specific capture of a perfectly matched molecule wi 
th a specific capture probe, with no capture or minimal capture of molec 
ules that contain a single-base mismatch between the target molecule and 
oligonucleotide capture probe. 
For example, 
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3' capture oligonucleotide probe for met 



hylated (M) sample 



— 3' capture oligonucleotide probe for unm 



ethyl ated (U) sample 

A methylated (M) sample generates a target molecule containing a cyt 
osine residue at the original cytosine position. Its complementary oligo 
nucleotide capture probe on the microarray device containing a guanosine 
residue captures this target. An unmethylated (U) sample generates a ta 
rget molecule containing uracil at the original cytosine position. Its c 
omplementary oligonucleotide capture probe on the microarray device, whe 
rein the oligo capture probe contains an adenosine residue at the corres 
ponding site, captures this sample. 

In a preferred embodiment of microarray hybridization assays, the ta 
rget molecules/samples are labeled with a fluorescent dye to produce a f 
luorescent signal that is detected by an optical detection instrument. A 
lternative means for detection of binding or hybridization includes vari 
ous electrochemical detection schemes wherein the bound target molecule/ 
oligonucleotide capture probe complex generates an electrode or an elect 
ric charge detectable by a nearby electrode. Thus, the methylation state 
of an unknown sample (test sample) can be determined by measuring bindi 
ng/hybridization (e.g., the fluorescent signal) at the methylated positi 
on (M) or unmethylated position (U) at different known locations on the 
microarray device. 

A single microarray device can contain tens, hundreds or thousands o 
f sites, each with a different capture probe oligonucleotide sequence mo 
lecule. The methylation state of tens, hundreds or thousands of CpG isla 
nds can be determined on a single microarray at one time. However, since 
determination of the methylation state of hundreds or thousands of CpG 
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island positions is performed at one time, non-specific or artifact even 
ts may interfere with robust determination of methylation state at each 
relevant position. The present inventive process significantly or comple 
tely eliminates the probability of obtaining false positives. This is ac 
hieved by incorporation an internal reference sample into the assay. 

A reference sample, also known as a control sample, is a nucleic aci 
d sample whose methylation state is known. Existing protocols for multip 
lex determination of methylation state on arrays require the availabilit 
y of a separate sample for use as a control or reference (Huang et al., 
Hum. Mol. Genetics 8:459-70, 1999; Balog et ah, Anal Biochem. 309:301-3 
10, 2002). The reference sample is obtained from normal tissue adjacent 
to a tumor tissue, for example. The methylation state of the reference s 
ample is independently determined before it can be used as a reference o 
r control. Alternatively, a reference sample can be produced by chemical 

synthesis of DNA representing the region being studied. In another prot 
ocol for multiplex determination of methylation state a reference or con 
trol sample is not used (Adorjan et al., Nucleic Acid Res. 30(5) :e2l). I 
n this case, the signal intensities from the methylated probe sequence a 
re compared to the signal intensity of the unmethylated probe sequence. 
However, non-specific or artifact events may interfere with robust ratio 

determination of methylation state at each relevant position. In a pref 
erred embodiment of the present invention a DNA sample is analyzed in on 
e fluorescent channel while the same DNA sample is used as a reference s 
ample in another fluorescent channel. The reference sample is prepared f 
rom the original sample such that any methylation in the original sample 

is removed to produce a reference sample that is used as an internal ne 
gative control. The only sample required for this embodiment is the DNA 
sample being tested. No other DNA is required, such as synthetically gen 
erated reference DNA or DNA from non-cancerous tissue. 
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For example, an unknown sample may contain both methylated and unmet 
hylated cytosine residues within CpG islands. When the unknown sample is 
used as a template in a polymerase chain reaction (PCR) the resultant a 
mplicon contains only unmethylated cytosine. This is due to the fact amp 
lification of the template incorporates unmethylated dCTP that is mixed 
into the polymerase reaction. The product of this reaction is used as a 
negative internal control in any hybridization assay. 

An unknown sample (test) and a known sample (reference) are mixed to 
gether and allowed to hybridize to the microarray device. Since both tes 
t and reference samples may hybridize to the same capture probe sequence 
s at a particular site on the microarray device, it will be impossible t 
o determine how much of the signal originated from the test sample and h 
ow much of the signal originated from the reference sample, irrespective 
of the choice of hybridization detection means employed. According to t 
he present inventive process, the test and reference samples are labeled 
with two different fluorescent dyes so that the signal from each source 
can be measured separately using wavelength-specific detection of f luor 
escence. Therefore, using two different fluorescent colors in a 2-color 
assay. In this manner, the signal intensity of the test sample is measur 
ed by detection in a reader channel to look for the first fluorescent dy 
e, and the signal intensity of the reference sample is measured by the d 
etection in a reader channel to look for the second fluorescent dye, whe 
rein the first and second fluorescent dye do not have overlapping emissi 
on and excitation wavelengths. 

The reference samples can be prepared in a number of different ways 
for presentation to a microarray device to measure hybridization. For ex 
ample, a starting material for this methylation assay is genomic DNA. Th 
is material is isolated and purified from tissues or cells using a numbe 
r of existing methods. Purified genomic DNA is prepared for microarray h 
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ybridization following the scheme shown in Figure 2. Roughly equal amoun 
ts of genomic DNA are placed in two separate tubes, one for reference sa 
mple preparation and one for test sample preparation. 

The reference sample is prepared by an initial PCR step (PCR1) using 

forward (Fi) and reverse (Ri) primers that are designed to anneal to th 
e template DNA at a position outside of the CpG island being tested to f 
orm an amp 1 icon. The amp 1 icon that is produced can have a length of appr 
oximately 50 base pairs or 500 base pairs to even 1000 base pairs. The a 
mplicon is purified and treated with sodium bisulfite using standard pro 
tocols. The treatment converts cytosine residues to deoxyuracil residues 

since cytosine residues in the reference sample are unmethylated (U) af 
ter the first PCR step (PCR1). Deoxyuracil residues behave as thymidine 
residues in subsequent enzymatic and annealing reactions. One method for 

the sodium bisulfite conversion step for a methylation assay follows a 
procedure described in Frommer et al., PNAS 89, 1827-1831. One method is 

to (1) Dilute DNA (up to 2 fig) with dH20 to 50 ul (the amount of DNA t 

0 be methylated per reaction should be kept constant); (2) Add 5.5 /ul 2 
M NaOH; (3) Incubate at 37 ° C for 10 min to denature DNA; (4) Add 30 ju 

1 of lOmM hydroquinone (prepare by adding 55 mg to 50mL dH20) ; (5) Add 5 
20 ix 1 freshly prepared 3M sodium bisulfite (prepared by adding 1.88 g s 
odium bisulfite to 5 mL dH20 - adjust pH to 5.0 with NaOH); (6) Mix well 
, incubate at about 50 ° C for 16 hours; and (7) Clean up DNA (e.g., Qia 
gen or Promega kit or a reverse phase column e.g., 3M Empore Disk cartri 
dges 4240). If a reverse phase column is used, (a) Add 450 microliters o 
f 10 mM Triethanolamine, 1 mM EDTA, 0. 1M Tris pH 7.7; (b) Wash twice wit 
h 750 microliters of 10 mM Triethanolamine, 1 mM EDTA, 0. 1M Tris pH 7.7; 

(c) Elute with 50/50 methanol/water + 0.3M NaOH; and (d) Speedvac until 
dry. In addition, continue the process by (8) Resuspend recovered DNA i 
n 50 ju 1 dH20, add 5.5 /x\ of 3M NaOH, incubate at room temperature for 
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5 min; (9) ETOH precipitate DNA, using a carrier such as glycogen; and ( 
10) store the DNA like RNA (i.e., keep cold, minimize freeze thaws, stor 
e at -20 ° C). 

The converted DNA can now be amplified by PCR. It is important to no 
te that the 2 strands of converted DNA are no longer complimentary, so o 
ne has to decide which strand (sense or antisense) to amplify. Primers a 
re designed to amplify fully converted DNA (i.e., all C residues are now 
T). It should be noted that theoretically, the amount of DNA to be meth 
ylated per reaction should be kept constant. Moreover, the bisulfite con 
version process is not that efficient as one has often measured 60-80% c 
onversion of unmethylated C's. Further, controls using sss-1 methylase ( 
NEB) can be generated to estimate conversion efficiencies. 

The converted genomic DNA product is used as a template in a second 
PCR step (PCR2) using forward (F2) and (R2) primers. Since the sodium bi 
sulfite conversion (SBC) step generated thymidine residues at every cyto 
sine position, F2 and R2 primers are designed to anneal to a template th 
at contains thymidine at every cytosine position. The forward primer als 
o contains an RNA polymerase promoter sequence at the 5' end for T7 poly 
merase. The purified amplicon from PCR2 is used in an in vitro transcrip 
tion reaction to generate single-stranded RNA molecules suitable for hyb 
ridization to the microarray. Fluorescent dyes, such as Cy3 (1, l'-bis(e 
-carboxypentyl)-l' ethyl-3, 3, 3' , 3'-tetramethyl indocarbocyanine-5, 5' -disul 
fonate potassium salt di-N-hydroxysuccinimide ester) or Cy5 (l,l'-bis(e 
-carboxypentyl) -1' ethyl-3, 3, 3' , 3' -tetramethyl indodicarbocyanine-5, 5' -dis 
ulfonate potassium salt di-N-hydroxysuccinimide ester) (Amersham) are in 
corporated into the product during transcription. In this example, Cy3 U 
TP (Amersham Cat# PA53026) is used to fluorescent ly label the reference 
target RNA. In the reference sample, all cytosine residues, including me 
thylated (M) and unmethylated (U) cytosine, convert to thymidine during 
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bisulfite conversion. The Cy3 labeled reference target RNA, therefore, c 
ontains uracil residues at every cytosine position in the original start 
ing material. 

The test sample does not undergo an initial PCR step as does the ref 
erence sample. Instead, the test sample DNA is treated directly with sod 
ium bisulfite to convert unmethylated (U) cytosine residues to thymidine 
. Methylated (M) cytosine residues are not converted and retain their cy 
tosine structure. The resulting product is used as a template in a PCR s 
tep (PCR2) using forward (F2) and reverse (R2) primers. The primers are 
designed to anneal to the template DNA at a position outside of the CpG 
island being tested. The amplicon that is produced can have a length of 
approximately 50 base pairs or 500 base pairs or even 1000 base pairs. 

The same PCR primers are used for the reference sample and the test 
sample in PCR2. In a preferred embodiment, the forward primer contains a 
n RNA polymerase promoter sequence at the 5' end for T7 polymerase. 
Different PCR primers pairs are required for amplification of each regio 
n that is being queried. When several CpG islands are in close proximity 
, the same pair of PCR primers is used to encompass all CpG islands in t 
he amplicon. When CpG islands are not in close proximity, separate PCR p 
rimer pairs are used for each CpG island being queried. When PCR amplifi 
cation is performed for multiple sites, amplification reactions can be d 
one in a multiplex fashion by combining multiple sets of PCR primers int 
o one reaction mixture yielding multiple sets of ampl icons from differen 
t regions of the DNA template. PCR primers for multiplex reactions are d 
esigned by accurately predicting primer hybridization, evaluating templa 
te secondary structure, selecting matching primer pairs, and identifying 
non-specific primer binding sites. Products from many multiplex reactio 
ns are combined to generate pools of ampl icons for tens, hundreds or eve 
n thousands of methylation sites. 
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The purified amplicon from PCR2 is used in an in vitro transcription 

reaction to generate single-stranded RNA molecules. The single-stranded 

RNA molecules are suitable for hybridization to the microarray. Fluores 

cent dyes, such as Cy3 (l, l'-bis( e -carboxypentyl)-l'ethyl-3,3,3' ,3'-tet 

ramethylindocarbocyanine-5,5'-disulfonate potassium salt di-N-hydroxysuc 

cinimide ester) or Cy5 (1, l'-bis( e -carboxypentyl)-l'ethyl-3, 3, 3' , 3' -tet 

ramethylindodicarbocyanine-5,5'-disulfonate potassium salt di-N-hydroxys 

uccinimide ester) are incorporated into the product during transcription 

. In this example, Cy5 UTP (Amersham Cat# PA55026) is used to fluorescen 

tly label the test target RNA. Therefore, in the test sample, all unmeth 

ylated (U) cytosine residues convert to thymidine during bisulfite conve 

rsion. Methylated (M) cytosine residues do not convert and retain their 

structure. The first fluorescent dye-labeled single stranded test target 

RNA contains uracil residues at every cytosine position in the original 

starting material except for cytosine positions that were methylated. I 

n that case, cytosine residues are present in the test target RNA at eve 

ry cytosine position in the original starting material. 

5 ' c 3' target molecule from methylated (M) samp 

le 

5 ' u 3* target molecule from unmethylated (U) sa 

mple 

The interaction between the target and the oligonucleotide capture p 
robe on the microarray device is able to discriminate between a perfect- 
match hybrid and single-mismatch hybrid by controlling hybridization con 
ditions. For example, highly stringent hybridization conditions to discr 
iminate between single base pair mismatches would be as follows: a test 
sample is prepared in 50 ul 6X SSPE, 5X Denhardt's reagent (Sigma), 0.05 
% Tween 20 and hybridized with a microarray device at 50 ° C for 6 hours 
- The microarray is washed with 300 fx 1 of 6X SSC and 0.05% Tween 20 at 
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50 ° C, followed by 300 M 1 of 2X SSC at 48 ° C, followed by 300 ju 1 of 
IX SSC and finally 300 ju 1 0.5X SSC both at room temperature. 

A perfect-match hybrid generates a fluorescent signal when imaged wi 
th a fluorescent microarray optical detection device (such as those manu 
factured by Axon Instruments, Agilent, Applied Precision and others). Th 
e single-mismatch hybrid is thermodynamically unstable and does not form 
a stable hybrid. Therefore, no fluorescent signal is generated at that 
position on the microarray device. 

A single base difference between a methylated and unmethylated sampl 
e is identified on the microarray by the presence or absence of signal a 
t positions containing the complementary sequence to each target molecul 
e in solution. Since the intensity of the fluorescent signal at each pos 
it ion reflects the amount of material in each sample, the state of methy 
lation at each CpG island is determined by measuring the fluorescent sig 
nal at a methylation (M) or unmethy lation (U) position on the microarray 

The reference sample produces signal at the unemethylated (U) positi 
on on the microarray in a second fluorescent probe detection channel. Th 
is serves as internal negative control and increases the reliability of 
results obtained from this assay. If the signal in the first fluorescent 
probe detection channel (i.e., test sample) is similar to the signal pa 
ttern in the second fluorescent probe channel (i.e., reference sample), 
the test sample is unmethylated at the cytosine position within the CpG 
island of interest. Conversely, if the signal in the first fluorescent p 
robe detection channel (i.e., test sample) is different from the signal 
in the second fluorescent probe channel (i.e., reference sample), the te 
st sample is methylated at the cytosine position within the CpG island o 
f interest. 
Microarray Ttesipn 
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In a test sample, all or most unmethylated (U) cytosine residues con 
vert to thymidine during bisulfite conversion. Methylated (M) cytosine r 
esidues do not convert and retain their structure. The first fluorescent 
labeled test target RNA contains uracil residues at every cytosine posi 
tion in the original starting material except for cytosine positions tha 
t were methylated. In that case, cytosine residues are present in the te 
st target RNA at every cytosine position in the original starting mater i 
al. 

5 ' c 3' target molecule from methylated (M) samp 

le 

5 ' u 3' target molecule from unmethylated (U) sa 

mple 

The interaction between the target and the capture probe on the micr 
oarray is able to discriminate between a perfect-match hybrid and single 
-mismatch hybrid under appropriate high stringency hybridization conditi 
ons. Appropriate high stringency hybridization conditions include, for e 
xample, hybridization in 50 M l 6X SSPE, 5X Denhardt's reagent (Sigma), 
0.05% Tween 20 at 50 ° C for 6 hours, followed by washing with 300 ^ 1 o 
f 6X SSC and 0.05% Tween 20 at 50 ° C, followed by washing with 300 ju 1 
of 2X SSC at 48 ° C, followed by washing with 300 fx 1 of IX SSC and fina 
lly with 300 M 1 0.5X SSC both at room temperature. 

A more preferable high stringent condition is conducting hybridizati 
on at 55° C in the solution above instead of 50° C. However, other than 
temperature, several factors, such as salt concentration, can influence 
the stringency of hybridization and one skilled in the art can suitably 
select the factors to accomplish a similar stringency. 

The perfect-match hybrid on the microarray generates a fluorescent s 
ignal when imaged with an optical detection device. Alternatively other 
common microarray detection technologies can be used, such as electroche 
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mical detection on microarray devices have electrodes and electronic sig 
nal hybridization detection technologies. The single-mismatch hybrid is 
thermodynamically unstable and does not form a stable hybrid. Therefore, 
no detectable signal is generated at that position on the microarray de 
vice. 

The microarray is designed to have at least one capture probe for th 
e methylated target (M) and one capture for the unmethylated target (U). 
Example sequences are shown below: 
Methylated Sample Capture Probe Sequence (M) 

5'-uauuuuuuuagguagcggguaguaguuguuu-3' target sequence [SEQ ID NO. 1] 
3'-auaaaaaaauccaucgcccaucaucaacaaa-5' capture probe sequence (3' -5') [SE 
Q ID NO. 2] 

5'-aaacaacuacuacccgcuaccuaaaaaaaua-3' capture probe sequence (5' -3') [SE 
Q ID NO. 3] 

Unmethylated Sample Capture Probe Sequence (U) 

5'-uauuuuuuuagguaguggguaguaguuguuu-3' target sequence [SEQ ID NO. 4] 
3'-auaaaaaaauccauca£ccaucaucaacaaa-5' capture probe sequence (3* -5') [SE 
Q ID NO. 5] 

5'-aaacaacuacuaccca£uaccuaaaaaaaua-3' capture probe sequence (5' -3') [SE 
Q ID NO. 6] 

Multiple methylation assays, are performed at the same time on one mi 
croarray. Here, methylation assays '1 through — ' are performed in paral 
lei. Each methylation assay has its own pair of methylated (M) and unmet 
hylated (U) capture probes on the microarray. 

Since the reference sample undergoes an initial PCR step, only unmet 
hylated cytosine is present in the DNA amplicon prior to transcription. 
The reference sample represents signal from an unmethylated source. The 
test sample is not treated with an initial PCR step and the methylation 
state of the cytosine residues is retained prior to sodium bisulfite con 
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version. The methylation state is determined by comparing the signal int 
ensities of the test sample in a second fluorescent probe channel to the 
test sample in a first fluorescent probe channel. If there is equal sig 
nal intensity in both the first and second channels the test sample is u 
nmethylated at the specific CpG island of interest. If there is signal i 
ntensity in the first fluorescent probe channel but not in the second fl 
uorescent probe channel, the test sample is methylated (see Figure 4). 

Figure 5 shows results from a multiplexed two-color methylation dete 
ction assay performed on a microarray. Multiple methylation sites (CpG i 
slands) are detected at one time on the same microarray (1 to •••). The ' 
M' position contains capture probe sequences for a methylated sample. Th 
e 'M' capture probe contains a guanosine residue at the cytosine posit io 
n of the original cytosine in CpG island and adenosine residues at all o 
ther cytosine positions. The 'U' position contains capture probe sequenc 
es for an unmethylated sample. The 'U' capture probe contains adenosine 
residues at all cytosine positions in the original sample. 

In the data illustrated in Figure 4, the second fluorescent probe si 
gnals obtained from the reference sample represent the pattern obtained 
by an unmethylated sample. Very little or no signal is detected at the M 
ethylation probe (M) and a large amount of signal is detected at the Unm 
ethylation probe (U). This pattern establishes the reference signal for 
an unmethylated sample. The first fluorescent probe signals represent th 
e methylation state of the test sample. The microarray determines the me 
thylation state of hundreds or thousands of CpG islands in a multiplex f 
ashion. The signal at position 1 shows the results from CpG island numbe 
r 1. The signal at position 2 shows the results from CpG island number 2 
, and so on. The signal at position 1 displays strong fluorescent signal 
at the Unmethylation probe (U) and very little or no signal at the Meth 
ylation probe (M). This pattern is similar to the signal pattern in the 
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reference sample for position 1. The interpretation is made that CpG isl 
and number 1 is unmethylated (U). 

Further, in Figure 4, the signal at position 2 displays strong fluor 
escent signal at both the Unmethylation probe (U) and the Methylation pr 
obe (M). This pattern is different from the signal pattern in the refere 
nee sample for position 2. Since both Methylation and Unmethylation prob 
es generate fluorescent signal in the test sample, the interpretation is 
made that CpG island number 2 has both a methylated and unmethylated al 
lele (M/U). The signal at position 3 displays strong fluorescent signal 
at the Methylation probe (M) and very little or no signal at the Unmethy 
lation probe (U). This pattern is different to the pattern in the refere 
nee sample for position 3. The interpretation is made that CpG island nu 
mber 3 is methylated (M). 

By following the interpretation logic illustrated with the data show 
n in Figure 4, the methylation state of hundreds or thousand of CpG isla 
nds can be determined on a single microarray at one time. Once the fluor 
escent signal is obtained, the process of interpretation, or calling the 
methylation state, can be performed using suitable computers and softwa 
re algorithms. This permits rapid interpretation of assay results with t 
he need for human intervention. 
Capture P robe Libra ry Screening 

Methylation of cytosine residues generally occur within CpG dinucleo 
tide positions. Computer algorithms that scan through known regions of t 
he genome predict sites where methylation of cytosine may occur (http:// 
www.uscnorris.com/cpgislands, Takai and Jones, Proc. Natl. Acad. Sci. 19 
; 99 (6): 3740-5, 2002). Additionally, computer databases store DNA methyl a 
tion sites and allow searching and retrieval of DNA sequences around the 
se sites (http://www.methdb.net/). 

The microarrays are composed of hundreds or thousands of different s 
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equence oligonucleotide capture probes to specifically capture target mo 
lecules. The target molecules contain either a cytosine residue at the o 
riginal methylation site if the sample is methylated, or a uracil residu 
e if the sample is unmethylated. 

5> c 3> target molecule from methylated (M) samp 

le 

3 ' S 5' capture probe for methylated (M) sample 

5 ' u 3' target molecule from unmethylated (U) sa 

mple 

3 ' a 5' capture probe for unmethylated (U) sampl 

e 

Specific hybridization of the target molecule to its complementary c 
apture probe is required for robust assay performance. Hybridization con 
ditions are designed so that a single base mismatch between a capture pr 
obe and target molecule does not form a stable hybrid. For example, hybr 
idization is performed in 50 ul 6X SSPE, 5X Denhardt's reagent (Sigma), 
0.05% Tween 20 at 50 ° C for 6 hours. The array is washed with 300 ft I o 
f 6X SSC and 0.05% Tween 20 at 50 ° C, followed by 300 ft 1 of 2X SSC at 
48 ° C, followed by 300 ft 1 of IX SSC and finally 300 ft I 0.5X SSC both 
at room temperature. 

It is also highly desirable to screen thorough a large number of cap 
ture probes at one time. High throughput screening of tens or hundreds o 
f different capture probes against the same target permits rapid and cos 
t-effective development of validated probes sets. 

A single-base mismatch placed a different positions along a capture 
probe can have significant impact on the performance of the capture prob 
e and in its ability to discriminate between a perfectly matched or sing 
le base mismatched target. It is desirable to rapidly and effectively sc 
reen a library of hundreds or thousands of different capture probes to i 
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dentify the most reliable sequence. For example, a library of sequences 
is generated by moving the position of the mismatch sequence (methylatio 
n position) along the capture probe sequence. 
Capture probe for methylated (M) sample: 
5'__ g 3 . 

5 » g 3. 

5' g 3. 

5' g 3' 

5' g 3 ' 

5> g 3. 

5' g 3. 

5' g 3 ' 

5' g 3 ' 

5' g ___3' 

In addition, a library of sequences is generated by increasing the 1 
ength of the capture probe sequence. 
Capture probe for methylated (M) sample: 
5' g 3* 

5' g 3* 

5' g 3' 

5' g 3' 

5' g 3' 

5' g 3 ' 

5' g 3 » 

5' g 3 ' 

5' g 3. 

By combining the probe length and mismatch position, hundreds or eve 
n thousands of different probes are designed for a single CpG island pos 
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it ion. The semiconductor based microarray system rapidly synthesizes all 
probes at one time and the entire library is empirically tested by a sc 
reening assay. Only probes with desired performance are selected and use 
d in the final assay. 



Example 1 

This example illustrates a multiplex methylation assay using the inv 
entive procedure and a microarray device from CombiMatrix Corp. (made us 

ing in situ synthesis with an electrochemical process). 
Sample Collectio n and DNA Purif icaHon 

Homo sapiens DNA mismatch repair (hMLHl) gene (GenBank ACCESSION U83 
845), a human primary colon carcinoma, cell line SW480 (purchased from A 
TCC; http://www.atcc.org/), and 293T (purchased from ATCC) were used. DN 
A was purified as follows. The cells were cultivated, and were collected 

from dishes. The collected cells were washed in PBS three times. The wa 
shed cells were centrifugated, and stored at -80 ° C. Cell aggregates we 
re suspended in reaction buffer, and were digested by proteinase K. Afte 
r digestion, the aqueous phases were extracted with a 1:1 mixture of equ 
ilibrated phenol and chloroform. DNA was recovered by 70% ethanol precip 
i tat ion, and was suspended in pure water or TE (l.Omg/mL). 
Sequence being investigated: 5' atcacctcagcag agg r.ar a r. aag rrr gg f t^ gg ^ atctc 

tgctcctattggctggatatttcgtattccccgagctcctaaaaacgaaccaataggaagagcggacagcga 
tctctaaegcgcaagcgcatatcct tctaggtag£gggcagtagccgct tcagggagggacgaagagaccca 

gcaacccacagagttgagaaatttgactggcattcaagctgtccaatcaatagctgccgctgaagggtgggg 

ctggatggcgtaagctacagctgaaggaagaacgtgagcacgaggcactgaggtgattggctgaaggcactt 
ccgttgagcatctagacgtttccttggctr.ttri-p g rpr^ ?; ,^ a f g -c{' [ SEQ ID N0 y -j 

Genomic DNA (100 ng) was aliquoted into two tubes, one labeled reference 
sample and the other labeled test sample. 
Test Sample Preparation 
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The Reference sample undergoes a PCR amplification with the following t 
wo primers: 

Forward primer 1 (Fl) : 5'-atcacctcagcagaggcacac-3' [SEQ ID NO. 8] 
Reverse primer 1 (Rl) : 5' -tttggcgccagaagagccaag-3' [SEQ ID NO. 9] 
PCR amplification was performed in a total volume of 50 M 1 containing 1 
X PCR Gold(R) buffer (Applied Biosystems, Foster City, CA), 2 mM MgCl2, 
0.2 mM deoxynucleotide triphosphates mixture (USB), 10 pmol forward prim 
er (Fl, SEQ ID NO. 8), 10 pmol reverse primer (Rl, SEQ ID NO. 9), 2 U Am 
plitaq Gold DNA(R) polymerase (ABI) , and 100 ng template DNA. Reaction 
conditions were as follows: 95 ° C for 10 minutes, and 39 cycles of 92 
° C for 30 seconds, 57 ° C for 30 seconds, and 70 ° C for 30 seconds, wi 
th a final elongation for 7 minutes at 70 ° C. The PCR products were ana 
lyzed by gel electrophoresis using a 2.5% agarose gel, stained with ethi 
dium bromide and visualized under UV illumination with a digital imaging 
system (NucleoTech) . Amplicons were purified using QIAquick(R) PCR puri 
ficat ion kits (Qiagen) following manufacturers protocol. 
Sodium bisulfite Aversion (SRf) 

One microgram purified amplicon was diluted in 50 M l of distilled w 
ater and denatured by addition of 1 ^ 1 10 M sodium hydroxide to a final 
concentration of 0.2 M and incubated for 10 minutes at 37 ° C. After in 
cubation, 30 M 1 10 mM hydorquinone (Sigma) and 520 M 1 of 3M sodium bis 
ulfite (Sigma) at pH 5.0 were added. The solution was incubated at 53 ° 
C for 18-20 hours. 

DNA was purified by QIAquick(R) purification kits (Qiagen) following 
the manufacturer's protocol. The DNA was desulfonated with 0.3 M sodium 
hydroxide for 10 minutes at room temperature, neutralized with 17 /ul o 

f 10 M ammonium acetate (Ambion) and then precipitated in 100% ethanol a 

t -80 ° C overnight. 

The second PCR amplification of the reference sample used the follow 



mfl# 2004-300631 1 



^12 0 0 2-3 7 5 1 0 2 

ing 

Forward primer 2 (F2) : 5' -taatacgactcactatagggattattttagtagaggtatat-3' [ 
SEQ ID NO. 10] 

Reverse primer 1 (R2) : 5'-tttggtgttagaagagttaag-3' [SEQ ID NO. 11]. 

Amplification was performed in a total volume of 50 ju 1 containing 1 
X PCR Gold(R) buffer (Applied Biosystems, Foster City, CA), 2 mM MgCl2, 
0.2 mM deoxynucleotide triphosphates mixture (USB), 10 pmol forward prim 
er 2 (F2, SEQ ID NO. 10), 10 pmol reverse primer (R2, SEQ ID NO. 11), 2 
U Amplitaq Gold DNA(R) polymerase (Applied Biosystems, Foster City, CA) 
, and 100 ng template DNA. Reaction conditions were as follows: 95 ° C f 
or 10 minutes, and 39 cycles of 92 ° C for 30 seconds, 57 ° C for 30 sec 
onds, and 70 ° C for 30 seconds, with a final elongation for 7 minutes a 
t 70 ° C. The PCR products were analyzed by gel electrophoresis using a 
2.5% agarose gel, stained with ethidium bromide and visualized under UV 
illumination with a digital imaging system (NucleoTech) . Amplicons were 
purified using QIAquick(R) PCR purification kits (Qiagen) following manu 
facturers protocol. 
Transcription 

One microgram purified amplicon containing T7 promoter sequence was 
transcribed in vitro in a total volume of 20 fx 1 using MEGAscript(R) Kit 
s (Ambion) following the manufacturers protocol with the addition of 5 
/j. 1 of 10 mM Cy3 UTP (Amersham). The transcripts were purified using RNe 
asy(R) purification kits (Qiagen) following the manufacturer's protocol. 

Test Samnlfi Preparation 

The test sample is first converted in a sodium bisulfite conversion 
(SBC) step. Briefly, one microgram genomic DNA was diluted in 50 fx 1 of 
distilled water and denatured by addition of 1 ^ 1 10 M sodium hydroxide 
to a final concentration of 0.2 M and incubated for 10 minutes at 37 ° 
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C. After incubation, 30 /u 1 10 mM hydorquinone (Sigma) and 520 /u 1 of 3M 
sodium bisulfite (Sigma) at pH 5.0 were added. Hie solution was incubat 
ed at 53 ° C for 18-20 hours. 

The DNA was desulfonated with 0.3 M sodium hydroxide for 10 minutes 
at room temperature, neutralized with 17 /x 1 of 10 M ammonium acetate (A 
mbion) and then precipitated in 100% ethanol at -80 ° C overnight. 

The test sample genomic DNA was then amplified by a PCR 2 amplificat 
ion using the following primers: 

Forward primer 2 (F2) : 5' -taatacgactcactatagggattattttagtagaggtatat-3' [ 
SEQ ID NO. 12] 

Reverse primer 1 (R2) : 5'-tttggtgttagaagagttaag-3' [SEQ ID NO. 13] 

PCR amplification was performed in a total volume of 50 /xl containi 
ng IX PCR Gold buffer (Applied Biosystems, Foster City, CA), 2 mM MgCl2, 
0. 2 mM deoxynucleotide triphosphates mixture (USB), 10 pmol forward pri 
mer 2 (F2), 10 pmol reverse primer (R2), 2 U Amplitaq Gold DNA polymeras 
e (Applied Biosystems, Foster City, CA), and 100 ng template DNA. React i 
on conditions were as follows: 95 ° C for 10 minutes, and 39 cycles of 9 
2 ° C for 30 seconds, 57 ° C for 30 seconds, and 70 ° C for 30 seconds, 
with a final elongation for 7 minutes at 70 ° C. The PCR products were a 
nalyzed by gel electrophoresis using a 2.5% agarose gel, stained with et 
hidium bromide and visualized under UV illumination with a digital imagi 
ng system (NucleoTech) . Ampl icons were purified using QIAquick PCR purif 
ication kits (Qiagen) following manufacturer's protocol. 

The test sample was then subject to transcription. Briefly, one micr 
ogram of purified ampl icon containing T7 promoter sequence was transcrib 
ed in vitro in a total volume of 20 /ul using MEGAscript Kits(R) (Ambion 
) following the manufacturers protocol with the addition of 5 n 1 of 10 
mM Cy5 UTP (Amersham). The transcripts were purified using RNeasy(R) pur 
if ication kits (Qiagen) following the manufacturer's protocol. 
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Hybridization and Wash 

The reference transcript (4 fig) was labeled with a second fluoresce 
nt dye, preferably Cy3 and combined with the test transcript (4 fig) lab 
eled with a first fluorescent dye, preferably Cy5 in 50 ul 6X SSPE, 5X D 
enhardt's reagent (Sigma), 0.05% Tween 20. The mixture was hybridized wi 
th a microarray device at 50 ° C for 6 hours. The microarray was washed 
with 300 ft 1 of 6X SSC and 0.05% Tween 20 at 50 ° C, followed by 300 ft 1 
of 2X SSC at 48 ° C, followed by 300 ft 1 of IX SSC and finally 300 ft 1 
0.5X SSC both at room temperature. 
Microarray Device 

The microarray is designed to have at least one capture probe for th 
e methylated target (M) and one capture for the unmethylated target (U). 

Methylated Sample Capture Probe Sequence (M) 

5'-uauuuuuuuagguagcggguaguaguuguuu-3' target sequence [SEQ ID NO. 1] 
3'-auaaaaaaauccaucgcccaucaucaacaaa-5' capture probe sequence (3' -5') [SE 
Q ID NO. 2] 

5'-aaacaacuacuacccgcuaccuaaaaaaaua-3' capture probe sequence (5' -3') [SE 
Q ID NO. 3] 

Unmethylated Sample Capture Probe Sequence (U) 

5'-uauuuuuuuagguaguggguaguaguuguuu-3' target sequence [SEQ ID NO. 4] 
3'-auaaaaaaauccaucacccaucaucaacaaa-5' capture probe sequence (3' -5') [SE 
Q ID NO. 5] 

5'-aaacaacuacuacccac.uaccuaaaaaaaua-3' capture probe sequence (5' -3') [SE 
Q ID NO. 6] 

Imaging a nd Data Analysis 

After the final wash step, the microarray was imaged using an optica 
1 detection instrument having a CCD camera (arrayfoRx Biochip Reader, Ap 
plied Precision). Two images were captured from each microarray correspo 
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nding to the emission wavelength of each fluorescent dye. The images wer 
e saved on a microcomputer and analyzed following manufacturer's instruc 
tions (softWoRx Tracker, Applied Precision). The fluorescent intensity a 
t each position on the microarray was quantified and saved as a spreadsh 
eet containing probe sequence and position information as well as fluore 
scent intensity of each dye. 

Intensity data was analyzed, for example, by calculating the ratio o 
f signal intensity between the test sample having the first fluorescent 
dye and the reference sample having the second fluorescent dye. In this 
manner, the methylation state of cytosine residues within a CpG island i 
s determined. 

Ratio analysis of data for probe signal and reference signal: 



This example performs the analysis of multiple CpG islands simultane 
ously using the procedure described in Example 1 for a single CpG island 

methylation determination. Signal intensities shown in Figure 4 are ana 
lyzed by calculating the ratios for each probe in both the test and refe 
rence signal channels. For example, the first row of the microarray show 
n in Figure 4 is designed to assess the methylation state of region 1 in 

a sample. The second row is designed to assess the methylation state in 

region 2 of the same sample, and so on. Each region in the sample may b 
e methylated (M) or unmethylated (U). The sample is prepared as describe 

in Example 1. The sample being tested is used as its own internal refer 
ence control. Preparation of the reference sample removes any methylatio 
n that may have been present. The reference target is labeled with Cy3 f 



Example 2 
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luorescent dye, for example, and its signal appears in the Cy3 detection 
channel. The test sample is processed so that methylation is retained d 
uring preparation. The test sample is labeled with Cy5 fluorescent dye, 
for example, and its signal appears in the Cy5 detection channel. Figure 
4 shows signal intensities appearing in each channel in black. 

The results for region 1 are shown in the first row of the microarra 
y. The reference sample appears with fluorescent signal at the unmethyla 
ted px-obe (U) position as expected. The test sample also appears with si 
gnal at the unmethylated probe (U) position, similar to the reference sa 
mple. By calculating these results using the formula provided in Example 
1, the interpretation is made that the sample is unmethylated (U) in re 
gion 1. 

The results for region 2 are shown in the second row of the microarr 
ay. The reference sample appears with fluorescent signal at the unmethyl 
ated probe (U) position as expected. The test sample appears with signal 
at both the unmethylated probe (U) position and the methylated (M) prob 
e position. By calculating these results using the formula provided in E 
xample 1, the interpretation is made that the sample contains both methy 
lated (M) and unmethylated (U) cytosine in region 2 in approximate equal 
proportions. This occurs when, for example, one allele in the sample is 
methylated and the other allele is unmethylated. This may also occur in 
a heterogeneous cell population where approximately one half of the eel 
Is are methylated in region 1. 

The results for region 3 are shown in the third row of the microarra 
y. The reference sample appears with fluorescent signal at the unmethyla 
ted probe (U) position as expected. The test sample appears with signal 
only at the methylated probe (M) position. By calculating these results 
using the formula provided in Example 1, the interpretation is made that 
the sample is entirely methylated (M) in region 3. 
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Microarrays containing tens, or hundreds, or even thousands position 
s can be used to determine the methylation state of tens, hundreds or ev 
en thousands of different CpG island regions within the same sample and 
in parallel. 
[Sequence Listing] 

SEQUENCE LISTING 

<110> National Institute of Advanced Industrial Science and Technology 

CombiMatrix K.K. 
<120> Nucleic Acid Methylation Detection Using Arrays 

<130> A3-A0207 

<160> 13 

<170> Patent In version 3. 1 

<210> 1 

<211> 31 

<212> RNA 

<213> Artificial 

<220> 

<223> an artificially synthesized target sequence 
<400> 1 

uauuuuuuua gguagcgggu aguaguuguu u 31 
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<210> 2 

<211> 31 

<212> RNA 

<213> Artificial 

<220> 

<223> an artificially synthesized capture probe sequence 

<400> 2 

aaacaacuac uacccgcuac cuaaaaaaau a 31 

<210> 3 

<211> 31 

<212> RNA 

<213> Artificial 

<220> 

<223> an artificially synthesized capture probe sequence 



<400> 



3 



aaacaacuac uacccgcuac cuaaaaaaau a 



31 



<210> 4 



<211> 31 



<212> RNA 



<213> Artificial 
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<220> 



<223> an artificially synthesized target sequence 



<400> 4 



uauuuuuuua gguagugggu aguaguuguu u 



31 



<210> 5 

<211> 31 

<212> RNA 

<213> Artificial 

<220> 

<223> an artificially synthesized capture probe sequence 

<400> 5 

aaacaacuac uacccacuac cuaaaaaaau a 31 

<210> 6 

<211> 31 

<212> RNA 

<213> Artificial 



<220> 



<223> 



an artificially synthesized capture probe sequence 



<400> 



6 



aaacaacuac uacccacuac cuaaaaaaau a 



31 
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<210> 7 

<211> 375 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized sequence 
<400> 7 

atcacctcag cagaggcaca caagcccggt tccggcatct ctgctcctat tggctggata 60 

tttcgtattc cccgagctcc taaaaacgaa ccaataggaa gagcggacag cgatctctaa 120 

cgcgcaagcg catatccttc taggtagcgg gcagtagccg cttcagggag ggacgaagag 180 

acccagcaac ccacagagtt gagaaatttg actggcattc aagctgtcca atcaatagct 240 

gccgctgaag ggtggggctg gatggcgtaa gctacagctg aaggaagaac gtgagcacga 300 

ggcactgagg tgattggctg aaggcacttc cgttgagcat ctagacgttt ccttggctct 360 

tctggcgcca aaatg 375 



<210> 8 
<211> 21 
<212> DNA 
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<213> Artificial 



<220> 



<223> an artificially synthesized primer sequence 



<400> 8 



atcacctcag cagaggcaca c 



21 



<210> 9 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized primer sequence 

<400> 9 

tttggcgcca gaagagccaa g 21 

<210> 10 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized primer sequence 
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<400> 10 



taatacgact cactataggg attattttag tagaggtata t 



41 



<210> 11 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized primer sequence 

<400> 11 

tttggtgtta gaagagttaa g 21 

<210> 12 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized primer sequence 



<400> 12 



taatacgact cactataggg attattttag tagaggtata t 



41 



<210> 13 
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<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> an artificially synthesized primer sequence 

<400> 13 

tttggtgtta gaagagttaa g 21 



[Brief Description of Drawings] 

Figure 1 shows a schematic of various prior methylation detection pr 
ocesses, some of which are described in the Background section. The pres 
ent invention can also be added to this scheme as bisulfite based. 

Figure 2 shows a schematic of a preferred embodiment of the invent iv 
e process beginning with DNA (A) and being divided into two arms. The le 
ft arm first PCR amplifies the sample (B), then performs bisulfite conve 
rsion and then labeling with a second fluorescent probe (Cy3). The right 
arm first bisulfite converts the sample (C), then PCR amplifies, then 1 
abels with a first fluorescent probe (Cy5) and finally both samples (B+C 
) are hybridized onto a DNA microarray device for detection in a two-col 
or fluorescent imaging reaction. 

Figure 3 shows a microarray device layout pattern for studying a lar 
ge set of CpG island methylation sites in parallel. A minimum of two fea 
tures or microarray sites is required on the microarray device for each 
methylation site that is queried. One feature of the microarray device p 
robes for the presence of methylated cytosine and a second feature of th 
e microarray device probes for the presence of unmethylated cytosine in 
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a sample. 

Figure 4 shows results from a multiplexed two-color methylation dete 
ction assay performed on a microarray. Multiple methylation sites (CpG i 
slands) are detected at one time on the same microarray. The 'W positio 
n contains capture probe sequences for a methylated sample. The 'M' capt 
ure probe contains a guanosine residue at the cytosine position of the o 
riginal cytosine in CpG island and adenosine residues at all other cytos 
ine positions. The 'IT position contains capture probe sequences for an 
unmethylated sample. The 'IT capture probe contains adenosine residues a 
t all cytosine positions in the original sample. 

Figure 5 shows hybridization discrimination between a perfect ly-matc 
hed 15mer DNA target and single-mismatch 15mer DNA target hybridized und 
er high stringency conditions. Capture probes were designed such that th 
e single mismatch position shifted from the fifth position of the captur 
e probe to the eleventh position. Spot intensity ratio between match and 
mismatch samples indicates that maximal discrimination was obtained whe 
n the mismatch was positioned at the center of the capture probe. 

Figure 6 shows preparation of a reference sample for the sequence re 
gion being studied. The sample being tested serves as its own internal r 
eference control according to the inventive process. Amplification of DN 
A by the first PCR step strips all methylation information from the samp 
le. During PCR1, methylated and unmethylated cytosine residues produce u 
nmethylated cytosine residues in the amplicon. The amplicon undergoes bi 
sulfite conversion and cytosine residues are converted to deoxyuracil, w 
hich behave as thymidine residues in further enzymatic and annealing rea 
ctions. The bisulf ite-converted product undergoes a second PCR step to a 
dd an upstream transcriptional promoter. The final Cy3-labeld transcript 
is used as a reference sample that generates the unmethylated signal pa 
ttern on the microarray. 
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Figure 7 shows preparation of a test sample for the region being stu 
died. The sample is treated with sodium bisulfite to convert unmethylate 
d cytosine residues to deoxyuracil, which behave as thymidine residues i 
n further enzymatic and annealing reactions. Methylated cytosine residue 
s are protected during the conversion and retain their original cytosine 
structure. The bisulf ite-converted products undergo a PCR step to add a 
n upstream transcriptional promoter. The final Cy5-labeld transcript is 
used as the test sample to generate methylated or unmethylated signals o 
n the microarray that reflect the original methylation state of the samp 
le being tested. 
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Reference Sample preparation (B) 



Test Sample preparation (C) 
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Combine and hybridize on microarray (B+C) 
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[H5] 



Spot 
intensity 



0.15 
0.13 
0.11 

0.09 
0.07 
0.05 
0.03 
0.01 



0.8 



— » — Perfect match target 

■ Single mismatch target 
I I Match/mismatch ratio 



2.6 



1.8 



13 1.3 
1.1 



1.0 
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Mismatch position along 15-mer capture probe 



Effects of Mismatch Position Along Capture Probe 
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5 ' -tatccttctaggtag^ggcagtagccgcttc-3 ' M / U 

| PCR1 

5 ' - tatccttctaggtag |cg| ggcagtagccgcttc-3 u 

j, SBC 

5 ' -tatttttttaggtag§g|ggtagtagttgtttt-3 ' U 

^ PCR2 



T7 1 5 ' -tatttttttaggtag(tg]ggtagtagttgtttt-3 ' U 

^ Transcription 

5 ' -uauuuuuuuagguag|ug|gguaguaguuquuu- 3 ' Represents 

Unmethylated 
Cy3 labeled target Cytosine (U) 



A CpG dinucleotide with or without methylation 
A CpG dinucleotide without methylation 
SBC Sodium bisulfite conversion 
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[HI 71 

5 ' -tatccttctaggtag|ci|ggcagtagccgcttc-3 ' M / U 

I SBC 

5 ' -tatttttttaggtagjtglggtagtagttgtttt-3 ' U 

or 

5 1 -tatttttttaggtag{c^ggtagtagttgtttt-3 ' j^j 

| PCR2 

5 ' -tatttttttaggtag|tg|ggtagtagttgtttt-3 ' U 



T7 



or 



T7 



5 ' -tatttttttaggtag§g|ggtagtagttgtttt-3 ' j^f 

^ Transcription 

5 ■ -uauuuuuuuagguag^gguaguaguuguuu- 3 ' Represents Unmethylated 

Cytosine sample (U) 



or 



5 ' -uauuuuuuuagguagj^gguaguaguuguuu- 3 ' Represents Methylated 
Cy5 labeled targets Cytosine sample (M) 



A CpG dinucleotide with or without methylation 
SBC Sodium bisulfite conversion 
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[Abstract] 

There is disclosed a process for detection of DNA methylation at CpG 
sites using nucleic acid arrays and preferably microarrays. Specif icall 
y, there is disclosed a process for directly generating a reference samp 
le from the sample to be tested and detecting methylation at large numbe 
rs of CpG island sites simultaneously. More specifically, the inventive 
process comprises dividing a DNA sample into two samples (a first sample 
and a second sample), amplifying the first DNA sample by a nucleic acid 
amplification process such that any methylcytosine residues are amplifi 
ed as unmethylated cytosine residues, treating the amplified first sampl 
e and the (unampl if ied) second sample with bisulfite to convert unmethyl 
ated cytosine residues in both samples to deoxyuracil residues, labeling 

the bisulfite-converted second sample with a second fluorescent marker 
and the bisulfite-converted first sample with a first fluorescent marker 
, wherein the first and second fluorescent markers have non-overlapping 
fluorescent excitation and emission spectra; and hybridizing the first s 
ample and the second sample onto a microarray device having a plurality 
of oligonucleotide capture probes designed to hybridize to CpG island si 
tes of the DNA sample as converted and non-converted by bisulfite. 

[Representative Drawing] 
Figure 2 
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